Incremental Multi - Step

نویسندگان

  • JING PENG
  • RONALD J. WILLIAMS
چکیده

This paper presents a novel incremental algorithm that combines Q-learning, a well-known dynamic programming-based reinforcement learning method, with the TD() return estimation process, which is typically used in actor-critic learning, another well-known dynamic programming-based reinforcement learning method. The parameter is used to distribute credit throughout sequences of actions, leading to faster learning and also helping to alleviate the non-Markovian eeect of coarse state-space quantization. The resulting algorithm, Q()-learning, thus combines some of the best features of the Q-learning and actor-critic learning paradigms. The behavior of this algorithm has been demonstrated through computer simulations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigation of Fracture Depth of Al/Cu Bimetallic Sheet in Single Point Incremental Forming Process

Single point incremental sheet forming (SPISF) has demonstrated significant potential to form complex sheet metal parts without using component-specific tools and is suitable for fabricating low-volume functional sheet metal parts economically. In the SPIF process, a ball nose tool moves along a predefined tool path to form the sheet. This work aims to optimize the formability and forming force...

متن کامل

Two point incremental forming of a complicated shape with negative and positive dies

In this work, incremental sheet forming of a complicated shape is investigated experimentally. Two point incremental forming with negative and positive dies are employed for manufacturing of a complicated shape with positive and negative truncated cones. The material is aluminum alloy 3105 with a thickness of 1 mm. The effects of process parameters such as sequence of positive and negative form...

متن کامل

Incremental Conductance Based Maximum Power Point Tracking for PV Multi-string Power Conditioning System

This paper deals with Multi-string Power Conditioning System for PV application and Maximum Power Point Tracking. This paper includes the results performed in PSIM-9 software such as Performance comparison of Incremental Conductance MPPT method with fixed step size and variable step size under varying irradiation conditions. Keywords—Multi-string Power Conditioning System (PCS), Maximum Power P...

متن کامل

Numerical and Experimental Analysis and Optimization of Process Parameters of AA1050 Incremental Sheet Forming

The incremental sheet metal forming (ISMF) process is a new and flexible method that is well suited for small batch production or prototyping. This paper studies the use of the finite element method in the incremental forming process of AA1050 sheets to investigate the influence of tool diameter, vertical step size, and friction coefficient on forming force, spring-back, and thickness distribut...

متن کامل

Investigation of Springback Angle in Single Point Incremental Forming Process on Explosive Welded Cu/St/Cu Multilaye

Nowadays, the role of light weight materials has grown up in important industries such as aerospace and biomechanics, but before the appliance, their strength should be increased. A modern way to increase this factor along with the lightweight factoris using bimetal sheets, hence, the design of multilayer sheets has been very much considered recently. In this study, explosive welded Cu/St/Cu mu...

متن کامل

Multi-Step Motion Planning for Free-Climbing Robots

This paper studies non-gaited, multi-step motion planning, to enable limbed robots to free-climb vertical rock. The application of a multi-step planner to a real free-climbing robot is described. This planner processes each of the many underlying one-step motion queries using an incremental, sample-based technique. However, experimental results point toward a better approach, incorporating the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996